IN THE CLAIMS 

Please add new claim 57 and amend claims 1 and 17 as follows. 



1 . (currently amended) A method of identifying molecules for production , wherein 
the molecules ^ represented by concatenated strings, said method comprising: 

i) encoding two or more related biological molecules into a data structure of initial 
character strings to provide a collection of two or more different initial character strings wherein 
each of said biological molecules comprises at least about 10 subunits; 

ii) selecting at least two substrings from said initial character strings; 

iii) concatenating said substrings to form one or more product strings about the same 
length as one or more of the initial character strings; 

iv) adding the product strings to a data structure to populate a data structure of product 
strings; and 

v) determining [whether the product strings have] sequence identities of the product 
strings relative to [of greater than 30% with] at least one initial character string; and 

vi) selecting [for production] one or more product biological molecules for production, 
wherein the one or more product biological molecules correspond [corresponding] to one or 
more of the product strings [determined to have] having greater than 30% sequence identity with 
the at least one initial character string. 



2. (previously amended) The method of claim 1, wherein said encoding comprises 
encoding two or more nucleic acid sequences into said character strings. 

3. (previously amended) The method of claim 2, wherein said two or more nucleic acid 
sequences comprise a nucleic acid sequence encoding a naturally occurring protein. 

4. (previously amended) The method of claim 1, wherein said encoding comprises 
encoding two or more amino acid sequences into said character strings. 

\ 

5. (previously amended) The methoasof claim 4, wherein said one or more amino acid 
sequences comprise an amino acid sequence encoding a naturally occurring protein. 
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6. (original) The method of claim 1, wherein said biological molecules have at least 30% 
sequence identity with each other. 

7. (original) The method of claim 1, wherein said selectin^comprises selecting substrings 
such that the ends of said substrings occur in string regions of about 3 to about 20 characters that 
have higher sequence identity with the corresponding region of another of said initial character 
strings than the overall sequence identity between the same two strings. 

8. (original) The method of claim 1, wherein said selecting comprises selecting substrings 
such that the ends of said substrings occur in predefined motifs of about 4 to about 8 characters. 

9. (original) The method of claim 1, wherein said selectin^and concatenating comprises 
concatenating substrings fi-om two different initial strings such that the concatenation occurs in a 
region of about three to about twenty characters having higher sequence identity between said 
two different initial strings than the overall sequence identity between said two different initial 
strings. 

10. (previously amended) The method of claim 1, wherein said selectin^omprises 
aligning two or more of said initial character strings to maximize pairwise identity between two 
or more substrings of the initial character strings, and selecting a character that is a member of an 
aligned pair for the end of one of the two or more substrings. 

1 1 (canceled) 
ll 

iS* (previously amended). The method of claim 1, wherein said method further comprises 
randomly altering one or more characters of said initial or product character strings. 

-id: (original) The method of claim Vt, wherein said method further comprises randomly 
selecting and altering one or more occurrences of a particular preselected character in said 
character strings. 

J4: (previously amended) The method of claim 1, wherein said encoding, selecting, or 
concatenating is performed on an internet site. 

+5*, (previously amended) The method of claim 1 , wherein said encoding, selecting, or 
concatenating is performed on a server. 
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J^. (previously amended) The method of claim 1, wherein said encoding, selecting, or 
concatenating is performed on a client linked to a network. 

1^+?-. (currently amended) A computer program product on a computer readable media 
comprising computer code that: 

i) encodes two or more biological molecules into initial character strings to provide a 
collection of two or more different initial character strings wherein each of said biological 
molecules comprises at least about ten subunits; 

ii) selects at least two initial substrings from said character strings; 

iii) concatenates said substrings to form one or more product strings about the same 
length as one or more of the initial character strings; 

iv) adds the product strings to a data structure to populate a data structure of product 
strings; and 

v) determines [whether the product strings have] sequence identities [of greater than 
30% with] of the product strings relative to at least one initial character string; and 

vi) selects [for production of] one or more product biological molecules [corresponding] 
for production, wherein the one or more product biological molecules correspond to one or more 
of the product strings [determined to have] having greater than 30% sequence identity with the at 
least one initial character string, 

''^-l* (previously amended) The computer program product of claim -Wf wherein said two 
or more biological molecules are nucleic acid sequences. 

^ \ . C . . 

4-9: (previously amended) The compMer program product of claim 4^7 wherein said two 
or more biological molecules are nucleic acid &Muences of naturally occurring proteins. 

20: (previously amended) The computer program product of claim ^wherein said two 
or more biological molecules are amino acid sequences. 

-2^. (previously amended) The computer program product of claim J-?; wherein said 
biological molecules have at least 30% sequence identity. 

zl ^ 
-22: (previously amended) The computer program product of claim -t77 wherein said 

computer code selects substrings such that the ends of said substrings occur in string regions of 

about three to about twenty characters that have higher sequence identity with a corresponding 
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region of another of said initial character strings than the overall sequence identity between the 
two substrings. 

(previously amended) The computer program product of claim 4^ wherein said 
computer code selects substrings such that the ends of said substrings occur in predefined motifs 
of about 4 to about 8 characters. 

54". (previously amended) The computer program product of claim^^ wherein said 
computer code selects and concatenates substrings fi-om two different initial strings such that the 
concatenation occurs in a region of about three to about twenty characters having higher 
sequence identity between said two different initial strings than the overall sequence identity 
between said two different initial strings. 

(previously amended) The computer program product of claim wherein the 
computer code selects substrings by aligning two or more of said initial character strings to 
maximize pairwise identity between two or more substrings of the character strings, and 
selecting a character that is a member of an aligned pair for the end of one substring. 

26. (canceled) 

-2^ . ^ . . 

-2?'. (previously amended) The computer program product of claim i^, wherem said 

computer code additionally randomly alters one or more characters of said character strings. 

(previously amended) The computer program product of claim^, wherein said 
computer code additionally randomly selects and alters one or more occurrences of a particular 
preselected character in said character strings. 

29. (previously amended) The computer program product of claim 17, wherein said 
computer code is stored on media selected^om the group consisting of magnetic media, optical 
media, optomagnetic media. 

Jh^, (previously amended) The computer program product of claim ¥t; wherein said 
computer code is in dynamic or static memory of a computer. 



^ dthdraSmj^ 



31-44. ( withdrawi 

-45: (previously added) The method of claim 1, wherein the initial character strings of (i) 



are related. 



7 




20 

46. (previously added) The method of claim 1, further comprising physically screening 
the molecule(s) represented by the product strings for one or more desired properties. 

31 

Af, (previously added) The method of claim 1, further comprising determining a 
computationally predicted property for molecules represented by the product strings. 

AS: (previously added) The method of claim 1, wherein the molecules represented by the 
product strings are made in parallel in an array of vessels, 

-49: (previously added) The method of claim 1 , wherein the molecules represented by the 
product strings are made by assembly of ohgonucleotides. 

50: (previously added) The method of claim 1, further comprising testing members of the 
data structure of product strings for a particular property and determining an optimal 
combination of sequences using multi-variate analysis. 

^ 16 _ . 

-5^. (previously added) The computer program product of claim iT, wherein the initial 

r character strings of (i) are related. 



26 16 

-52*; (previously added) The computer program product of clainxi?; wherein the code 

instructs physical screening of the molecule(s) represented by the product strings for one or more 

desired properties. 

(previously added) The computer program product of claim J-?, wherein the code 
instructs determination of a computationally predicted property for molecules represented by the 
product strings. 

3? 16 

(previously added) The computer program product of claim +7, wherein the 

molecules represented by the product strings are made in parallel in an array of vessels. 

(previously added) The computer program product of claim wherein the 
molecules represented by the product strings are made by assembly of oligonucleotides. 

AO . 

(previously added) The computer program product of claim wherein the code tests 

members of the data structure of product strings for a particular property and determines an 

optimal combination of initial sequences using multi-variate analysis. 
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4 ) ^ (new) A method of identifying molecules for production, wherein the molecules 
are represented by concatenated strings, said method comprising: 

i) encoding two or more related biological molecules into a data structure of initial 
character strings to provide a collection of two or more different initial character strings wherein 
each of said biological molecules comprises at least about 10 subunits; 

ii) selecting at least two substrings from said initial character strings; 

iii) concatenating said substrings to form one or more product strings; 

iv) adding the product strings to a data structure to populate a data structure of product 
strings; and 

V) determining whether the product strings have at least a predefined measure of 
similarity with at least one initial character string; and 

vi) selecting one or more product biological molecules for production, wherein the one or 
more product biological molecules correspond to one or more of the product strings determined 
to have greater than the predefined value of sequence identity with at least one initial string. 



